A New Experience in Persian Text Clustering using FarsNet Ontology
نویسندگان
چکیده
MOHAMMAD ZANJANI, AHMAD BARAANI DASTJERDI, EHSAN ASGARIAN, ALIREZA SHAHRIYARI AND AMIR AKHAVAN KHARAZIAN Department of Information and Communication Technology South Pars Gas Complex Asalouyeh, I.R. Iran Department of Computer Engineering, School of Engineering SheikhBahaee University Isfahan, I.R. Iran Department of Computer Engineering, Faculty of Engineering Isfahan University Isfahan, I.R. Iran Department of Computer Engineering, Faculty of Engineering Ferdowsi University of Mashhad Mashhad, I.R. Iran Department of Computer Engineering and Mathematics, Faculty of Engineering Kingston University London, KT12EE UK Email: [email protected]
منابع مشابه
Towards Semi Automatic Construction of a Lexical Ontology for Persian
Lexical ontologies and semantic lexicons are important resources in natural language processing. They are used in various tasks and applications, especially where semantic processing is evolved such as question answering, machine translation, text understanding, information retrieval and extraction, content management, text summarization, knowledge acquisition and semantic search engines. Altho...
متن کاملOntology-Based Automatic Text Summarization Using FarsNet
To summarize a text means to compress the text source into a shorter text in a way that the informational content is kept the same. With regard to the irregular volume of information available on the internet, manual summarization of huge volume of information by humans will be very arduous and difficult. There have been many activities in the field of automatic summarization so far. However, a...
متن کاملPersian Wordnet Construction using Supervised Learning
This paper presents an automated supervised method for Persian wordnet construction. Using a Persian corpus and a bi-lingual dictionary, the initial links between Persian words and Princeton WordNet synsets have been generated. These links will be discriminated later as correct or incorrect by employing seven features in a trained classification system. The whole method is just a classification...
متن کاملExtracting Lexico-conceptual Knowledge for Developing Persian WordNet
Semantic lexicons and lexical ontologies are some major resources in natural language processing. Developing such resources are time consuming tasks for which some automatic methods are proposed. This paper describes some methods used in semi-automatic development of FarsNet; a lexical ontology for the Persian language. FarsNet includes the Persian WordNet with more than 10000 synsets of nouns,...
متن کاملخوشهبندی اسناد مبتنی بر آنتولوژی و رویکرد فازی
Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Inf. Sci. Eng.
دوره 31 شماره
صفحات -
تاریخ انتشار 2015